Learning Visual Symbols for Parsing Human Poses in Images
نویسندگان
چکیده
Parsing human poses in images is fundamental in extracting critical visual information for artificial intelligent agents. Our goal is to learn selfcontained body part representations from images, which we call visual symbols, and their symbolwise geometric contexts in this parsing process. Each symbol is individually learned by categorizing visual features leveraged by geometric information. In the categorization, we use Latent Support Vector Machine followed by an efficient cross validation procedure. Then, these symbols naturally define geometric contexts of body parts in a fine granularity. When the structure of the compositional parts is a tree, we derive an efficient approach to estimating human poses in images. Experiments on two large datasets suggest our approach outperforms state of the art methods.
منابع مشابه
Investigating common visual symbols in photography of the Islamic Revolution based on Pearce's pattern
The Islamic revolution, has remaining very influential on the way people lived, changed the path of art and especially the photography of Iran. There are many photographs of the Revolution. By examining the visual cues, the changes in the cultural and artistic trends of that period can be better understood. Are there any shared signs between the images in the photograph and do they share certai...
متن کاملLearning to parse images of articulated bodies
We consider the machine vision task of pose estimation from static images, specifically for the case of articulated objects. This problem is hard because of the large number of degrees of freedom to be estimated. Following a established line of research, pose estimation is framed as inference in a probabilistic model. In our experience however, the success of many approaches often lie in the po...
متن کاملVirtual to Real Reinforcement Learning for Autonomous Driving
Reinforcement learning is considered as a promising direction for driving policy learning. However, training autonomous driving vehicle with reinforcement learning in real environment involves non-affordable trial-and-error. It is more desirable to first train in a virtual environment and then transfer to the real environment. In this paper, we propose a novel realistic translation network to m...
متن کاملDiscriminative Hierarchical Part-Based Models for Human Parsing and Action Recognition
We consider the problem of parsing human poses and recognizing their actions in static images with part-based models. Most previous work in part-based models only considers rigid parts (e.g., torso, head, half limbs) guided by human anatomy. We argue that this representation of parts is not necessarily appropriate. In this paper, we introduce hierarchical poselets—a new representation for model...
متن کاملTimescales for Sparseness of Natural Sound: Implications for Auditory-Symbols Processing
The statistical structure of the natural visual environment influences the strategies for sensory processing in the retina and the thalamus of primates, where second-order redundancy is reduced, and in the primary visual areas of cortex, where filters with high output-kurtosis, which expose the “sparseness” of the visual environment, are believed to facilitate scene parsing and object segmentat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013